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Abstract. Current experiments collecting high statistics in ultra- high energy cosmic 
rays (UHECRs) are opening a new window on the universe. In this work we discuss 
a large scale structure model for the UHECR origin which evaluates the expected 
anisotropy in the UHECR arrival distribution starting from a given astronomical 
catalogue of the local universe. The model takes into account the main selection 
effects in the catalogue and the UHECR propagation effects. By applying this method 
to the IRAS PSCz catalogue, we derive the minimum statistics needed to significatively 
reject the hypothesis that UHECRs trace the baryonic distribution in the universe, in 
particular providing a forecast for the Auger experiment. 
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1. Introduction 

Almost a century after the discovery of cosmic rays, a satisfactory explanation of 
their origin is still lacking, the main difficulties being the poor understanding of the 
astrophysical engines and the loss of directional information due to the bending of their 
trajectories in the galactic (GMF) and extragalactic magnetic field (EGMF). 

More in detail, given the few-/zG intensity of regular and turbulent GMF, a diffusive 
confinement of cosmic rays of galactic origin is expected up to rigidity 7Z = pc/Z e ~ 
few x 10 17 V, p being the cosmic ray momentum, Z its charge in units of the positron one, 
and c the speed of light. Still at 1Z ~ few xlO 18 V cosmic rays are strongly deflected, 
and no directional information can be extracted. Around 7Z ~ 10 19 V the regime of 
relatively small deflections in the GMF starts. The transition decades 1Z ~ 10 17 -10 19 V, 
though not yet useful for "directional" astronomy, may still show a rich phenomenology 
(drifts, scintillation, lensing) which is an interesting research topic of its own [1]. 

At energies above a few x 10 19 eV, which we will refer to as the ultra-high energy 
(UHE) regime, protons propagating in the Galaxy retain most of their initial direction. 
Provided that EGMF is negligible, UHE protons will therefore allow to probe into 
the nature and properties of their cosmic sources. However, due to quite steep CR 
power spectrum, UHECRs are extremely rare (a few particles km -2 century -1 ) and 
their detection calls for the prolonged use of instruments with huge collecting areas. 
One further constraint arises from an effect first pointed out by Greisen, Zatsepin and 
Kuzmin [2, 3] and since then known as GZK effect: at energies E > 5xl0 19 eV the 
opacity of the interstellar space to protons drastically increases due to the photo-meson 
interaction process p + 7cmb — * vr°( + ) + p{n) which takes place on cosmic microwave 
background (CMB) photons. In other words, unless the sources are located within a 
sphere with radius of (9(100) Mpc, the proton flux at E > 5xl0 19 eV should be greatly 
suppressed. However, due to the very limited statistics available in the UHE regime (cf. 
Volcano Ranch [4], SUGAR [5], Haverah Park [6, 7], Fly's Eye [8, 9, 10], Yakutsk [11] 
AGASA [12], HiRes [13, 14], and, very recently, also Auger [15]), the experimental 
detection of the GZK effect has not yet been firmly established. 

It has to be stressed that the theoretical tools available to probe this extremely 
interesting part of the CRs spectrum are still largely inadequate: both the modelling and 
the data interpretation impose either strong assumptions based on little experimental 
evidence or the extrapolation by orders of magnitudes of available knowledge. For 
instance, the structure and magnitude of the EGMF are poorly known. Only recently, 
magnetic fields were included in simulations of large scale structures (LSS) [16, 17]. 
Qualitatively the simulations agree in finding that EGMFs are mainly localized in galaxy 
clusters and filaments, while voids should contain only primordial fields. However, the 
conclusions of Refs. [16] and [17] are quantitatively rather different and it is at present 
unclear whether deflections in extragalactic magnetic fields will prevent astronomy even 
with UHE protons or not. Another large source of uncertainty is our ignorance on the 
chemical composition of UHECRs, mainly due to the need to extrapolate for decades in 
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energy the models of hadronic interactions. They are an essential input for the Monte 
Carlo simulations used in the analysis and reconstruction of UHECRs showers, but the 
predictions of such simulations differ appreciably already in the knee region (around 
10 15 eV), even when high quality data and deconvolution techniques are used [18]. 
Future accelerator measurements of hadronic cross sections in higher energy ranges will 
ameliorate the situation, but this will take several years at least. 

From now on, therefore, we shall work under the assumptions that UHE astronomy 
is possible, namely: i) proton primaries, for which eTZ = E; ii) EGMF negligibly small; 
hi) extragalactic astrophysical sources are responsible for UHECR acceleration. Now 
the question arises: might one support this scenario using the directional information 
in UHECRs? A possibility favoring these hypothesis is that relatively few, powerful 
nearby sources are responsible for the UHECRs, and the small scale clustering observed 
by AGASA [19] may be a hint in this direction. However, the above quoted clustering has 
not yet been confirmed by other experiments with comparable or larger statistics [20, 21], 
and probably a final answer will come when the Pierre Auger Observatory [22] will 
have collected enough data. Independently on the observation of small-scale clustering, 
one could still look for large scale anisotropies in the data, eventually correlating with 
some known configuration of astrophysical source candidates. In this context, the most 
natural scenario to be tested is that UHECRs correlate with the luminous matter in 
the "local" universe. This is particularly expected for candidates like gamma ray bursts 
(hosted more likely in star formation regions) or colliding galaxies, but it is also a 
sufficiently generic hypothesis to deserve an interest of its own. 

Aims of this work are: i) to describe a method to evaluate the expected anisotropy 
in the UHECR sky starting from a given catalogue of the local universe, taking into 
account the selection function, the blind regions as well as the energy-loss effects; ii) 
to assess the minimum statistics needed to significatively reject the null hypothesis, in 
particular providing a forecast for the Auger experiment. Previous attempts to address 
a similar issue can be found in [23, 24, 25, 26]. Later in the paper we will come back to 
a comparison with their approaches and results. 

The catalogue we use is IRAS PSCz [27]. This has several limitations, mainly due 
to its intrinsic incompleteness, but it is good enough to illustrate the main features of the 
issue, while still providing some meaningful information. This work has to be intended 
as mainly methodological. An extension to the much more detailed 2MASS [28, 29] and 
SDSS [30, 31] galaxy catalogues is presently investigated. 

The paper is structured as follows: the catalogue and the related issues are discussed 
in Section 2. In Section 3 we describe the technique used for our analysis. The results are 
discussed in Section 4, where we compare our findings with those obtained in previous 
works. In Section 5 we give a brief overview on ongoing research and experimental 
activities, and draw our conclusions. Throughout the paper we work in natural units 
h = k B = c = 1, though the numerical values are quoted in the physically most suitable 
units. 
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Figure 1. PSCz catalogue source distribution and related mask in galactic 
coordinates. 

2. Astronomical Data 

2.1. The Catalogue 

Two properties are required to make a galaxy catalogue suitable for the type of analysis 
discussed here. First, a great sky coverage is critical for comparing the predictions 
with the fraction of sky observed by the UHECR experiments (the Auger experiment 
is observing all the Southern hemisphere and part of the Northern one). Second, the 
energy-loss effect in UHECR propagation requires a knowledge of the redshifts for at 
least a fair subsample of the galaxies in the catalogue. Selection effects both in fluxes 
and in redshifts play a crucial role in understanding the final outcome of the simulations. 

Unfortunately, in practical terms this two requirements turn out to be almost 
complementary and no available catalogue matches both needs simultaneously. A fair 
compromise is offered by the IRAS PSCz catalogue [27] which contains about 15 000 
galaxies and related redshifts with a well understood completeness function down to 
z ~ 0.1 — i.e. down to a redshift which is comparable to the attenuation length 
introduced by the GZK effect — and a sky coverage of about 84%. The incomplete 
sky coverage is mainly due to the so called zone of avoidance centered on the Galactic 
Plane and caused by the galactic extinction and to a few, narrow stripes which were 
not observed with enough sensitivity by the IRAS satellite (see Fig. 1). These regions 
are excluded from our analysis with the use of the binary mask available with the PSCz 
catalogue itself. 

2.2. The Selection Function 

No available galaxy catalogue is complete in volume and therefore completeness 
estimates derived from the selection effects in flux are needed. More in detail, the 
relevant quantity to be derived is the fraction of galaxies actually observed at the various 
redshifts, a quantity also known as the redshift selection function <fr(z) [32]. A convenient 
way to express <f>(z) is in terms of the galaxy luminosity function (i.e. the distribution 
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of galaxy luminosities) <&(L) as 

JV°° M dL$(L) 

/AM = ^!iki Lji l) 

0[) SS°dL*(L) ■ [i) 

Here L m[n (z) is the minimum luminosity detected by the survey in function of redshift. 
By definition, for a flux-limited survey of limiting flux fu m , L m ; n (z) is given in terms of 
the luminosity distance cLl{z) as 

L m i n {z) = 4ird 2 L (z) /u m . (2) 

The luminosity distance depends on the cosmology assumed, though for small redshifts 
(z £ 0.1) it can be approximated by d^z) ~ z/H . 

Generally <fi(z) is inferred from the catalogue data itself in a self-consistent way, 
using the observational galaxy luminosity distribution to estimate 3>(L) [27, 33, 34]. The 
quantity n(z)/4>(z) represents the experimental distribution corrected for the selection 
effects, which must be used in the computations. A detailed discussion of this issue 
can be found in Ref. [35]. Furthermore, we wish to stress that up to z ~ 0.1 evolution 
effects are negligible and the local universe galaxy luminosity function can be safely used. 
In the case of deeper surveys like SDSS, cosmological effects cannot be neglected and 
our approach can still be employed even though a series of corrections, like evolutionary 
effects or scale-dependent luminosity, must be taken into account [36] . These corrections 
are needed since luminous galaxies, which dominate the sample at large scales, cluster 
more than faint ones [37]. In the case of the PSCz catalogue the selection function is 
given as [27] 

l-ar /.x 71 -(f) 



1 



(3) 



with the parameters 0* = 0.0077, a = 1.82, r* = 86.4, 7 = 1.56, (3 = 4.43 that 
respectively describe the normalization, the nearby slope, the break distance in h~ 1 Mpc, 
its sharpness and the additional slope beyond the break (see also Fig. 2). 

It is clear, however, that even taking into account the selection function we cannot 
use the catalogue up to the highest redshifts (z — 0.3), due to the rapid loss of 
statistics. At high z, in fact, the intrinsic statistical fluctuation due to the selection effect 
starts to dominate over the true matter fluctuations, producing artificial clusterings not 
corresponding to real structures ("shot noise" effect). This problem is generally treated 
constructing from the point sources catalogue a smoothed density field p(Cl, z) with a 
variable smoothing length that effectively increases with redshift, remaining always of 
size comparable to the mean distance on the sphere of the sources of the catalogue. We 
minimize this effect by being conservative in setting the maximum redshift at z = 0.06 
(corresponding to 180 /i _1 Mpc) where we have still good statistics while keeping the shot 
noise effect under control. With this threshold we are left with ~ 11, 500 sources of the 
catalogue. Furthermore, for the purposes of present analysis, the weight of the sources 
rapidly decreases with redshift due to the energy losses induced by the GZK effect. In 
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Figure 2. Experimental redshift distribution of the PSCz catalogue galaxies and 
prediction for an homogeneous universe from the selection function 4>(z) (from [27]); 
both are normalized in order to represent the number of sources per unit of redshift 
per steradian. 

the energy range E > 5xl0 19 eV, the contribution from sources beyond z ~ 0.06 is sub- 
dominant, thus allowing to assume for the objects beyond z = 0.06 an effective isotropic 
source contribution. 

3. The Formalism 

In the following we describe in some detail the steps involved in our formalism. In 
Sec. 3.1 we summarize our treatment for energy losses, in Sec. 3.2 the way the "effective" 
UHECR map is constructed, and in Sec. 3.3 the statistical analysis we perform. 

3.1. UHECRs Propagation 

The first goal of our analysis is to obtain the underlying probability distribution 
fhss{&-,E) to have a UHECR with energy higher than E from the direction Cl. For 
simplicity here and throughout the paper we shall assume that each source of our 
catalogue has the same probability to emit a UHECR, according to some spectrum 
at the source g{Ej). In principle, one would expect some correlation of this probability 
with one or more properties of the source, like its star formation rate, radio-emission, 
size, etc. The authors of Ref. [26] tested for a correlation Ltjhecr ^ -^fir) -^uhecr being 
the luminosity in UHECRs and Lfir the one in far-infrared region probed in IRAS 
catalogue. The results of their analysis do not change appreciably as long as ,5 k ^ 1. 
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We can then expect that our limit of k = might well work for a broader range in 
parameter space, but this is not of much concern here, since we do not stick to specific 
models for UHECR sources. The method we discuss can be however easily generalized 
to such a case, and eventually also to a multi-parametric modelling of the correlation. 

In an ideal world where a volume-complete catalogue were available and no energy 
losses for UHECRs were present, each source should then be simply weighted by the 
geometrical flux suppression oc dj 2 . The selection function already implies the change 
of the weight into <p~ l d1 2 . Moreover, while propagating to us, high-energy protons lose 
energy as a result of the cosmological redshift and of the production of pairs and 
pions (the dominant process) caused by interactions with CMB. For simplicity, we shall 
work in the continuous loss approximation [38]. Then, a proton of energy E$ at the 
source at z — z,- L will be degraded at the Earth (z = 0) to an energy Ef given by the 
energy-loss equation| 

\ dE dt . n n n . 

Eq.(4) has to be integrated from Zj, where the initial Cauchy condition E(z = Zj) = Ei 
is imposed, to z = 0. The different terms in Eq. (4) are explicitly shown below 
dt 



= [(1 + z)H^{l + z)m M + Q A }~\ (5) 

p rsh (z) = H oy J(i + z)3n M + n A , (6) 

(3 n (z,E) ~a(l + ^) 3 , E>E match (7) 
A n (l + zfe'W , E<E match (8) 

where we assume for the Hubble constant H = 7li 3 km/s/Mpc, and Qm — 0.27 
and Q\ ~ 0.73 are the matter and cosmological constant densities in terms of the 
critical one [39]. In the previous formulae, m e and m p are respectively the electron 
and proton masses, To is the CMB temperature, and a the fine-structure constant. 
Since we are probing the relatively near universe, the results will not depend much 
from the cosmological model adopted, but mainly on the value assumed for Hq. More 
quantitatively, the r.h.s of Eq. (4) changes linearly with Hq 1 (apart for the negligible 
term (3 rs ^), while even an extreme change from the model (Qm = 0.27; Q\ = 0.73) to 
(fljvf = 1; = 0.0) (the latter ruled out by present data) would only modify the energy 
loss term by 6% at z ~ 0.06, the highest redshift we consider. 

The parameterization for (3 n as well as the values {A n , B n , C n } = 
{3.66xl0~ 8 yr" 1 ,2.87xl0 20 eV,2.42xl0~V" 1 } are taken from [40], and E match (z) = 
6.86 e~ om7z x 10 20 eV is used to ensure continuity to j3 n (z,E). An useful parameteriza- 
tion of the auxiliary function <p(£) can be found in [41], which we follow for the treatment 

| We are neglecting diffuse backgrounds other than CMB and assuming straight-line trajectories, 
consistently with the hypothesis of weak EGMF. 
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of the pair production energy loss. In practice, we have evolved cosmic rays over a log- 
arithmic grid in Ej_ from 10 19 to 10 23 eV, and in z from 0.001 to 0.3. The values at a 
specific source site has been obtained by a smooth interpolation. 

Note that in our calculation i) the propagation is performed to attribute an "energy- 
loss weight" to each z in order to derive a realistic probability distribution /lss(^, 
ii) we are going to "smooth" the results over regions of several degrees in the sky (see 
below), thus performing a sort of weighted average over redshifts as well. Since this 
smoothing effect is by far dominant over the single source stochastic fluctuation induced 
by pion production, the average effect accounted for by using a continuous energy-loss 
approach is a suitable approximation. 

In summary, the propagation effects provide us a "final energy function" Ef{E^ z) 
giving the energy at Earth for a particle injected with energy Ei at a redshift z. Note 
that, being the energy-loss process obviously monotone, the inverse function Ei(Ef,z) 
is also available. 



3.2. Map Making 

Given an arbitrary injection spectrum g(Ei), the observed events at the Earth 
would distribute, apart for a normalization factor, according to the spectrum 
g(Ei(Ef, z))dEi/dEf. In particular we will consider in the following a typical power-law 
g(Ei) oc E~ s , but this assumption may be easily generalized. Summing up on all the 
sources in the catalogue one obtains the expected differential flux map on Earth 

E f) « Yl ^I^to. *)g *>• w 

where the selection function and distance flux suppression factors have been taken into 
account. However, given the low statistics of events available at this high energies, a 
more useful quantity to employ is the integrated flux above some energy threshold E cut , 
that can be more easily compared with the integrated UHECR flux above the cut E cui . 
Integrating the previous expression we have 

f LSS (Q, E cut ) oc £ 1 ~ "* } I" E~ s dE 

= E/lssW S(Cl-Cl k ), (11) 

k 

that can be effectively seen as if at every source k of the catalogue it is assigned a 
weight fhss(k) that takes into account geometrical effects (d~[ 2 ), selection effects (0~ 1 ), 
and physics of energy losses through the integral in dE. In this "GZK integral" the upper 
limit of integration is taken to be infinite, though the result is practically independent 
from the upper cut used provided it is much larger than 10 20 eV. 

It is interesting to compare the similar result expected for an uniform source 
distribution with constant density; in this case we have (in the limit K<1) 

/ LSS (Q, E cut ) oc J dz [Ei{Ec ^f S+ = J dz P ( Zj E cnu 8 ), (12) 
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where the integral in dE has been explicitly performed and the flux suppression weight 
is cancelled by the geometrical volume factor. The integrand p(z, E cut , s) containing 
the details of the energy losses also provides an effective cut at high z. The integrand 
— when normalized to have unit area — can be interpreted as the distribution of the 
injection distances of CR observed at the Earth. It also suggests the definition of 
the so-called "GZK sphere" as the sphere from which originates most (say 99%) of 
the observed CR flux on Earth above an energy threshold E cnt . In Fig. 3 we plot 
the distribution p for different values of E cut and s. We see that around a particular 
threshold zqzk the distribution falls to zero: the dependence of ^gzk on E cut is quite 
critical as expected, while there is also a softer dependence on s. This suggests naturally 
the choice E cut = 5xl0 19 eV for the chosen value z GZ k — 0.06; at the same time, the 
energy cut chosen is not too restrictive, ensuring indeed that a significant statistics 
might be achieved in a few years. For this E cut the isotropic contribution to the flux 
is sub-dominant; however we can take it exactly into account and the weight of the 
isotropic part is given by§ 



w iso oc / dzp(z,E cut ). (13) 

Finally, to represent graphically the result, the spike-like map (11) is effectively 
smoothed through a gaussian filter as 

/ LSS (fi, E cut ) oc £ / LSS (fc) exp (-^^) + ^2^(0). (14) 

In the previous equation, a is the width of the gaussian filter, d s is the spherical distance 
between the coordinates fl and flk, and is the catalogue mask (see Section 2.1) 
such that /i(fi) = if Cl belongs to the mask region and = 1 otherwise. 



3.3. Statistical Analysis 

Given the extremely poor UHECR statistics, we limit ourselves to address the basic 
issue of determining the minimum number of events needed to significatively reject "the 
null hypothesis" . To this purpose, it is well known that a x 2 -test is an extremely good 
estimator. Notice that a x 2_ test needs a binning of the events, but differently from 
the K-S test performed in [26] or the Smirnov-Cramer-von Mises test of [25], it has no 
ambiguity due to the 2-dimensional nature of the problem, and indeed a similar approach 
was used in [23]. A criterion guiding in the choice of the bin size is the following: with 
iV UHECRs events available and M bins, one would expect 0(N/M) events per bin; 
to allow a reliable application of the x 2_ t es t, one has to impose N/M > 10. Each cell 
should then cover at least a solid angle of Am ~ 10 x A tot /iV, A tot being the solid angle 
accessible to the experiment. For A tot ~ 2n (50% of full sky coverage), one estimates 
a square window of side A5A°/\/ r N, i.e. 45° for 100 events, 14° for 1000 events. Since 
the former number is of the order of present world statistics, and the latter is the 

§ The normalization factor is fixed consistently with Eqs. (11)-(12). 
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Figure 3. Distribution of the injection distances of CR observed at the Earth for fixed 
E cut = 5xl0 19 eV (top) and s = 1.5,2.0,2.5,3.0 and for fixed spectral index s = 2.0 
(bottom) and varying E cut = 3, 5, 7, 9xl0 19 eV. The area subtended by p(z) has been 
normalized to unity. 

achievement expected by Auger in several years of operations, a binning in windows 
of size 15° represents quite a reasonable choice for our forecast. This choice is also 
suggested by the typical size of the observable structures, a point we will comment 
further at the end of this Section. Notice that the GMF, that induces at these energies 
typical deflections of about 4° [43] , can be safely neglected for this kind of analysis. The 
same remark holds for the angular resolution of the experiment. 

Obviously, for a specific experimental set-up one must include the proper exposure 
u cxp , to convolve with the previously found /lss- The function u; CX p depends on the 
declination 5, right ascension RA, and, in general, also on the energy. For observations 
having uniform coverage in RA, like AGASA or Auger ground based arrays, one can 
easily parameterize the relative exposure as [42] 

^ex P (<5) oc cos #0 sin a m cos S + a m sin #0 sin 5, (15) 
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Figure 4. Galactic coordinate reference frame and contours enclosing 68%, 95% and 
99% of the Auger exposure function, with the corresponding declinations. The celestial 
equator (5 = 0°) and south pole (S = —90°) are also shown. 



where 6q is the latitude of the experiment (6*0 ~ —35° for Auger South), a m is given by 

[ 0, if f >1 

a m = I 7T, if £ < -1 (16) 

I cos^ 1 £ , otherwise 

and 

_ cos fl max - sin 6 sin 5 . . 

cos 6*o cos o 

^max being the maximal zenith angle cut applied (we assume # max = 60° for Auger). 
Contour plots for the Auger exposure function in galactic coordinates are shown in 
Fig. 4. 

For a given experiment and catalogue, the null hypothesis we want to test is that 
the events observed are sampled — apart from a trivial geometrical factor — according 
to the distribution /lss ^cx P A*- Since we are performing a forecast analysis, we will 
consider test realizations of N events sampled according to a random distribution on 
the (accessible) sphere, i.e. according to u; exp /i, and determine the confidence level 
(C.L.) with which the hypothesis is rejected as a function of N. For each realization of 
N events we calculate the two functions 

•^^i^f 1 - (18) 



iso v 



M f„ r \ £ 1\2 
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where Oj is the number of "random" counts in the i-th bin fij, and cJ/lss] an d e i[/iso] are 
the theoretically expected number of events in respectively for the LSS and isotropic 
distribution. In formulae (see Eq. (11)), 



subtended by the angular bin and similarly S u = f^dVLuj exp ^i. The mock data 
set is then sampled M times in order to establish empirically the distributions of A'lss 
and X? Q , and the resulting distribution is studied as function of N (plus eventually 
s,E mt , etc.). The parameter 



is a mask-correction factor that takes into account the number of points belonging to 
the mask region and excluded from the counts Oj. Note that the random distribution is 
generated with N events in all the sky view of the experiment, but, effectively, only the 
region outside the mask is included in the statistical analysis leaving us with effective 
Na events to study. This is a limiting factor due to quality of the catalogue: With a 
better sky coverage the statistics is improved and the number of events required to asses 
the model can be reduced. 

As our last point, we return to the problem of choice of the bin size. To assess 
its importance we studied the dependence of the results on this parameter. For a cell 
side larger than about ~ 25° the analysis loses much of its power, and a very high 
N is required to distinguish the models and obtain meaningful conclusions. This is 
somewhat expected looking at the map results that we obtain, where typical structures 
have dimensions of the order 15° — 20°. A greater cell size results effectively in a too 
large smoothing and a consequent lost of information. On the other hand, a cell size 
below 4° — 6° makes the use of a x 2 analysis not very reliable, because of the low number 
of events in each bin expected for realistic exposure times. In the quite large interval 
~ 6° — 20° for the choice of the cell size, however, the result is almost independent of 
the bin size, that makes us confident on the reliability of our conclusions. 

4. Results 

In Fig. 5 we plot the smoothed maps in galactic coordinates of the expected integrated 
flux of UHECRs above the energy threshold E cut = 3, 5, 7, 9xl0 19 eV and for slope 
parameter s = 2.0; the isotropic part has been taken into account and the ratio of the 
isotropic to anisotropic part w- mo / J2k /lss(^) is respectively 83%, 3.6%, <C 1%, <C 1%. 

Only for E cut = 3xl0 19 eV the isotropic background constitutes then a relevant 
fraction, since the GZK suppression of far sources is not yet present. For the case 




a = 



JdQ u exp (S) 



(22) 




Figure 5. Equal area Hammer- Aitoff projections of the smoothed UHECRs arrival 
directions distribution (Eq. (14)) in galactic coordinates obtained for fixed s = 2.0 and, 
from the upper to the lower panel, for E cut = 3, 5, 7, 9xl0 19 eV. The smoothing angle 
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Figure 6. Detailed key of the structures visible in the UHECR maps; arbitrary contour 
levels. Labels correspond to: (1) Southern extension of Virgo and Local Supercluster; 
(2)Fornax-Eridani Cluster; (3) Cassiopea Cluster; (4) Puppis Cluster; (5) Ursa Major 
Cloud; (6-7) Pavo-Indus and "Great Attractor" region; (8) Centaurus Super-Cluster; 
(9) Hydra Super-Cluster; (10) Perseus Super-Cluster; (11) Abell 569; (12) Pegasus 
Cluster; (13-17) Pisces Cluster; (14) Abell 634; (15) Coma Cluster; (16-18) Hercules 
Supercluster; (19) Leo Supercluster; (20) Columba Cluster; (21) Cetus Cluster; (22) 
Shapley Concentration; (23) Ursa Major Supercluster; (24) Sculptor Supercluster; (25) 
Bootes Supercluster. 

of interest E cnt = 5xl0 19 eV the contribution of w- lso is almost negligible, while it 
practically disappears for E cut > 7xl0 19 eV. Varying the slope for s = 1.5,2.0,2.5,3.0 
while keeping E cnt = 5xl0 19 eV fixed produces respectively the relative weights 
8.0%, 3.6%, 1.8%, 0.9%, so that only for very hard spectra Wi SO would play a non- 
negligible role (see also Fig. 3). 

Due to the GZK-effect, as it was expected, the nearest structures are also the most 
prominent features in the maps. The most relevant structure present in every slide 
is the Local Supercluster. It extends along I ~ 140° and / ~ 300° and includes the 
Virgo cluster at I = 284°, b = +75° and the Ursa Major cloud at I = 145°, b = +65°, 
both located at z ~ 0.01. The lack of structures at latitudes from I ~ 0° to I ~ 120° 
corresponds to the Local Void. At higher redshifts the main contributions come from 
the Perseus-Pisces supercluster (/ = 160°, b = —20°) and the Pavo-Indus supercluster 
(I = 340°, b = —40°), both at z ~ 0.02, and the very massive Shapley Concentration 
(/ = 250°, b = +20°) at z ~ 0.05. For a more detailed list of features in the map, see 
the key in Fig. 6. 

The E'cut-dependence is clearly evident in the maps: as expected, increasing E cut 
results in a map that closely reflects the very local universe (up to z ~ 0.03 — 0.04) 
and its large anisotropy; conversely, for E cut ~ 3,4xl0 19 eV, the resulting flux is quite 
isotropic and the structures emerge as fluctuations from a background, since the GZK 
suppression is not yet effective. This can be seen also comparing the near structures with 
the most distant ones in the catalogue: while the Local Supercluster is well visible in all 
slides, the signal from the Perseus-Pisces super-cluster and the Shapley concentration 
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Table 1. The probability (in %) to reject the isotropic hypothesis at (90%:99%) C.L. 
when UHECRs follow the LSS distribution, as a function of the injection spectral index 
and of the observed number of events, fixing E cut = 5xl0 19 eV. 

is of comparable intensity only in the two top panels, while becoming highly attenuated 
for E cut = 7xl0 19 eV, and almost vanishing for E cut = 9xl0 19 eV. A similar trend is 
observed for increasing s at fixed -Ecut, though the dependence is almost one order of 
magnitude weaker. Looking at the contour levels in the maps we can have a precise 
idea of the absolute intensity of the "fluctuations" induced by the LSS; in particular, 
for the case of interest of E cut = 5xl0 19 eV the structures emerge only at the level of 
20%-30% of the total flux, the 68% of the flux actually enclosing almost all the sky. 
For E cnt = 7, 9xl0 19 eV, on the contrary, the local structures are significantly more 
pronounced, but in this case we have to face with the low statistics available at this 
energies. Then in a low-statistics regime it's not an easy task to disentangle the LSS 
and the isotropic distributions. 

The structures which are more likely to be detected by Auger (see also Fig. 4) 
are the Shapley concentration, the Southern extension of the Virgo cluster, the Local 
Supercluster and the Pavo-Indus super-cluster. Other structures, such as the Perseus- 
Pisces supercluster and the full Virgo cluster are visible only from the Northern 
hemisphere and are therefore within the reach of experiments like Telescope Array [44], 
or the planned North extension of the Pierre Auger Observatory. Moreover, the sky 
region obscured by the heavy extinction in the direction of the Galactic Plane reflects a 
lack of information about features possibly "hidden" there. Unfortunately, this region 
falls just in the middle of the Auger field of view, thus reducing — for a given statistics 
N — the significance of the check of the null hypothesis. Numerically, this translates 
into a smaller value of the factor a of Eq. (22) with respect to an hypothetical "twin" 
Northern Auger experiment. 

A quantitative statistical analysis confirms previous qualitative considerations. In 
Table 1 we report the probability to reject the isotropic hypothesis at 90% and 99% 
C.L. when UHECRs follow the LSS distribution, as a function of the injection spectral 
index and of the observed number of events, fixing E cut = 5xl0 19 eV. In Figure 7 
we show the distributions of the functions X? and Xl ss introduced in the previous 
section for s = 2.0,3.0 and N = 200,1000, for the same cut E cut = 5xl0 19 eV. It 
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Figure 7. The probability distributions of the estimators X? a and X^ ss for the cases 
s = 2.0,3.0 and for N = 200, 1000 events, fixing E cut = 5xl0 19 cV. The distribution 
are the results of 10000 monte-carlo simulation like described in the text. 

is clear that a few hundreds events are hardly enough to reliably distinguish the two 
models, while N = 800-1000 should be more than enough to reject the hypothesis at 
2-3 a, independently of the injection spectrum. Steeper spectra however slightly reduce 
the number of events needed for a given CL. discrimination. It is also interesting to 
note that, using different techniques and unconstrained LSS simulations, it was found 
that a comparable statistics is needed to probe a magnetized local universe [17]. It is 
worthwhile stressing that our conclusions should be looked as conservative, since only 
proton primaries have been assumed, and constant source properties. Variations in 
individual source power and a mixed composition could increase the "cosmic variance" 
and make more difficult to distinguish among models for the source distribution [17]. 

With respect to previous literature on the subject, our analysis is the closest to the 
one of Ref . [23] . Apart for technical details, the greatest differences with respect to this 
work arise because of the improved determination of crucial parameters undergone in 
the last decade. Just to mention a few, the Hubble constant used in [23] was 100 km 
s" 1 Mpc -1 , against the presently determined value of 711^ km s _1 Mpc -1 : this changes 
by a 30% the value of the quantity zqzk (see Sec. 3.2). Moreover, the catalogue [45] 
that was used in [23] contains about 1/3 of the objects we are considering, has looser 
selection criteria and larger contaminations [27]. Finally, the specific location of the 
Southern Auger observatory was not taken into account. All together, when considering 
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these factors, we find quite good agreement with their results. 

Some discrepancy arises instead with the results of [26], whose maps appear to be 
dominated by statistical fluctuations, which mostly wash away physical structures. This 
has probably to be ascribed to two effects, the energy cut E cut = 4xl0 19 eV and the 
inclusion of high redshift object (up to z ~ 0.3) of the catalogue [27] in their analysis. 
Their choice of E cnt = 4xl0 19 eV implies indeed zqzk — 0.1, i.e. a cutoff in a redshift 
range where shot noise distortions are no longer negligible. The same remarks hold 
for Ref. [25], which also suffers of other missing corrections [26]. Also, in both cases, 
the emphasis is mainly in the analysis of the already existing AGASA data than in a 
forecast study. Our results however clearly show that AGASA statistics — only 32 data 
at E > 5xl0 19 eV in the published data set [46], some of which falling inside the mask — 
is too limited to draw any firm conclusion on the hypothesis considered. 

5. Summary and conclusion 

In this work we have summarized the technical steps needed to properly evaluate the 
expected anisotropy in the UHECR sky starting from a given catalogue of the local 
universe, taking into account the selection function, the blind regions, and the energy- 
loss effects. By applying this method to the catalogue [27], we have established the 
minimum statistics needed to significatively reject the null hypothesis, in particular 
providing a forecast for the Auger experiment. We showed with a \ 2 approach that 
several hundreds data are required to start testing the model at Auger South. The most 
prominent structures eventually "visible" for this experiment were also identified. 

Differently from other statistical tools based e.g. on auto-correlation analysis, the 
approach sketched above requires an Ansatz on the source candidates. The distribution 
of the luminous baryonic matter considered here can be thought as a quite generic 
expectation deserving interest of its own, but it is also expected to correlate with 
many sources proposed in the literature. In any case, if many astrophysical sources 
are involved in UHECR production, it is likely that they should better correlate with 
the local baryonic matter distribution than with an isotropic background. 

As already stated, this work has to be intended as mainly methodological. 
Until now, the lack of UHECR statistics and the inadequacy of the astronomical 
catalogues has seriously limited the usefulness of such a kind of analysis. However, 
progresses are expected in both directions in forthcoming years. From the point of 
view of UHECR observatories, the Southern site of Auger is almost completed, and 
already taking data. Working from January 2004 to June 2005, Auger has reached a 
cumulative exposure of 1750 km 2 sr yr, observing 10 events over 10 1 9 eV=5xl0 19 eV 
(see the URL: www.auger.org/icrc2005/spectrum.html), Notice that statistical and 
systematic errors are still quite large, and a down-shift in the log 10 £' scale of 0.1 would 
for example change the previous figure to 17 events. Once completed, the total area 
covered will be of 3000 km 2 , thus improving by one order of magnitude present statistics 
in a couple of years [47]. The idea to build a Northern Auger site strongly depends on 
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the possibility to perform UHECR astronomy, for which full sky coverage is of primary 
importance. In any case, the Japanese- American Telescope Array in the desert of Utah 
is expected to become operational by 2007 [48]. It should offer almost an order of 
magnitude larger aperture per year than AGASA in the Northern sky, with a better 
control over the systematics thanks to a hybrid technique similar to the one employed 
in Auger. 

The other big step is expected in astronomical catalogues. The 2MASS survey [28] 
has resolved more than 1.5 million galaxies in the near-infrared, and has been explicitly 
designed to provide an accurate photometric and astrometric knowledge of the nearby 
Universe. The observation in the near IR is particularly sensitive to the stellar 
component, and as a consequence to the luminous baryons. Though the redshifts 
of the sources have to be obtained via photometric methods, the larger error on the 
distance estimates (about 20% from the 3-band 2MASS photometry [29]) is more 
than compensated by the larger statistics. An analysis of this catalogue for UHECR 
purposes is in progress. Independently of large sky coverage, deep surveys like SDSS [31] 
undoubtedly have an important role in mapping the local universe as well. For example, 
the information encoded in such catalogues can be used to validate methods — like the 
neural networks [49, 50, 51] — used to obtain photometric redshifts. An even better 
situation is expected from future projects like SDSS II (see the URL: www.sdss.org). 
Finally, a by-product of these surveys is the discovery and characterization of active 
galactic nuclei [52, 53], which in turn could have interesting applications in the search 
for the sources of UHECRs. 
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